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We propose a new extension of higher-order pushdown automata, which allows to use an infinite 
alphabet. The new automata recognize languages of data words (instead of normal words), which 
beside each its letter from a finite alphabet have a data value from an infinite alphabet. Those data 
values can be loaded to the stack of the automaton, and later compared with some farther data values 
on the input. Our main purpose for introducing these automata is that they may help in analyzing 
normal automata (without data). As an example, we give a proof that deterministic automata with 
collapse can recognize more languages than deterministic automata without collapse. This proof is 
simpler than in the no-data case. We also state a hypothesis how the new automaton model can be 
related to the original model of higher-order pushdown automata. 

1 Introduction 

Already in the 70's, Maslov ( lfl2l \13\ ) generalized the concept of pushdown automata to higher-order 
pushdown automata and studied such devices as acceptors of string languages. In the last decade, re- 
newed interest in these automata has arisen. They are now studied also as generators of graphs and trees. 
Knapik et al. ifTTTl showed that the class of trees generated by deterministic level-?! pushdown automata 
coincides with the class of trees generated by safe level-?! recursion schemes (safety is a syntactic re- 
striction on the recursion scheme), and Caucal (H gave another characterization: trees on level n + 1 
are obtained from trees on level n by an MSO-interpretation of a graph, followed by application of un- 
folding. Carayol and Wohrle Q studied the e-closures of configuration graphs of level-?! pushdown 
automata and proved that these graphs are exactly the graphs in the n-th level of the Caucal hierarchy. In 
order to deal with non-safe schemes, Hague et al. [8] extended the model of level-?! pushdown automata 
to level-« collapsible pushdown automata by introducing a new stack operation called collapse. They 
showed that the trees generated by such automata coincide exactly with the class of trees generated by 
all higher-order recursion schemes and this correspondence is level-by-level. Nevertheless, for a long 
time it was an open question whether safety implies a semantical restriction to recursion schemes. It was 
proven recently [14], and the proof is technically rather complicated. 

We extend the definition of higher-order pushdown automata by allowing infinite alphabets, both on 
input and on stack. More precisely, every symbol consists of two parts: a label from a finite alphabet, 
and a data value from an infinite alphabet. The part from the finite alphabet is treated as normally. On 
the other hand all the data values are assumed to be equivalent; the automaton can only test whether two 
data values are equal (data values from the input can be stored on the stack, and afterwards compared 
with some future data values on the input). Words over such a two-part alphabet are called data words. 
We consider both collapsible and non-collapsible variant of the automata. 

Data words are widely considered, for example in the context of XML processing. In XML docu- 
ments we have tag names and attribute names (which are from a finite alphabet), but we also have a text 
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data; typically we don't want to analyze the internal structure of such a text, we just want to compare 
texts appearing indifferent places in the document (or to compare them with some constants). Several 
automata models and logics were already considered for data words, see iflol for a survey. Some of the 
automata models are generalizations of finite automata to data words ifTOll . other J4l[3l are designed to 
cover some logics, like FO 2 or XPath. A model of pushdown automata (of level 1) using an infinite 
alphabet (very similar to our model) was considered in 0. (Historically the first models of pushdown 
automata over infinite alphabet were presented in O and (9 ] ; they simply allow infinitely many rules and 
infinitely many transition functions, one for each letter in the alphabet). We are not aware of any works 
which consider infinite alphabets for higher-order pushdown automata. 

The main technical content of this paper is a proof that deterministic collapsible pushdown automata 
with data recognize more languages than non-collapsible ones, as stated below. 

Theorem 1.1. There exists a data language recognized by a deterministic collapsible pushdown automa- 
ton with data of the second level, which is not recognized by any deterministic higher-order pushdown 
automaton with data (without collapse) of any level. 

The proof of this theorem is extracted from the analogous proof for automata without data, contained 
in lTT4l . Although we consider a more complicated model of automata, the proof is in fact simpler. The 
reason is that when having data values, we can precisely trace where this data value is stored in the 
stack. We just need to analyze how this places are related for different data values, hence in which order 
the data values can be recovered from the stack. The idea for automata without data is similar, but its 
realization is much more complicated. Instead of considering data values, we consider blocks containing 
the same letter repeated very many times. We look in which place of the stack the automaton "stores" the 
number of these repetitions. However now this place cannot be easily identified. Moreover, it is difficult 
to change the number of repetitions in one place, so that the whole run stays similar. This problems 
disappear when we have data values. Thus our paper can be rather seen as an illustration of the original 
proof. 

We also state a hypothesis how the new classes of automata are related to the original one. Namely, let 
us consider the (already mentioned) encoding of data words into normal words, in which a data value is 
represented as a number of repetitions of a symbol. Of course every data word has many encodings, as a 
data value may be mapped to an arbitrary number of repetitions. Assume that we have an automaton (for 
normal words) which either accepts all representations of a data word, or none of them. Our hypothesis 
would be that then we can create an automaton with data, which recognizes the corresponding language 
of data words. One can easily see that the opposite implication holds: an automaton with data can 
be converted to a normal automaton recognizing all representations of data words in the language (the 
number of repetitions of a symbol on the input is stored on the stack as a number of repetitions of some 
stack symbol). 

If this hypothesis would be proved to be true, our Theorem 11.11 would imply the original theorem of 
|[l4l that collapsible pushdown automata (without data) recognize more languages than non-collapsible 
ones (the representation of the data word language from Theorem 11.11 would give a correct example 
language for normal words). 

A next goal would be to prove analogous result for nondeterministic automata, i.e. that nondeter- 
ministic collapsible pushdown automata with data recognize more languages than non-collapsible ones. 
Notice that this is not the case for level 2 [1], but is believed to be true for higher levels. It looks likely 
that this can be shown for automata with data, much easier than for automata without data. 
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2 Preliminaries 

In this section we define higher-order pushdown automata with data. 

Let us fix any infinite set of data values B, e.g. B = N. For a finite alphabet A, a data word over A is 
just a word over A x B. For a data word w, by n(w) we denote the (normal) word over A obtained from 
w by projecting every letter to its first coordinate. 

For any alphabet T (of stack symbols) we define a k-th level stack (a k-stack for short) as an element 
of the following set r*: 

r° = Tx (BU {no-data}), 

ri = (r k - l y fori<K«. 

In other words, a 0-stack contains a symbol from T and possibly a data value (or no-data if no data value 
is stored), and a &-stack for 1 < k < n is a (possibly empty) sequence of (k — l)-stacks. Top of a stack is 
on the right. The size of a &-stack is just the number of (k — 1) -stacks it contains. For any s k 6 and 
s k ~ l € r^ 1 we write s k : s k ~ l for the /c-stack obtained from s k by placing s k ~ l at its end; the convention 
is that s 2 : s l : s° = s 2 : (s l : s°). We say that an rc-stack s is well formed if every /c-stack in s is nonempty, 
for 1 < k < n (including the whole s). In configurations of our automata we allow only well formed 
stacks. 

An n-fh level deterministic higher-order pushdown automaton with data («-HOPAD for short) is a 
tuple = (A,F,Yj,Q,q],F,A) where A is an input alphabet, T is a stack alphabet, // € T is an initial 
stack symbol, Q is a set of states, qj G Q is an initial state, F C Q is a set of accepting states, and 
ACQxrx(AU {e}) x Q x OP£ is a transition relation, where OP^ contains the following operations: 

• pop k , where 1 < k < n, and 

• push^, where 1 < k < n, and a € T. 

The transition relation A should be deterministic, which means that it is partial function from QxTx 
(A U {e}) to Q x OPy, which moreover is never simultaneously defined for some (q,a,a) £ Q x T x A 
and for (q,a,e). The letter n is used exclusively for the level of pushdown automata. 

A configuration of g/ consists of a state and of a well formed stack, i.e. is an element of Q x T" 
in which the «-stack is well formed. The initial configuration consists of the initial state qj and of the 
rc-stack containing only one 0-stack, which is (//, no-data). 

Next, we define when c \-^ a ^ d , where c,c' are configurations, and (a,d) E (A x B) U {(£, no-data)}. 
Let p be the state of c, q the state of c' , and let {a,d') be the topmost 0-stack of c. We have c \-( a < d ') c' if 

• (p, a,a,q, pop k ) G A, and the stack of c' is obtained from the stack of c by replacing its topmost 
&-stack s k : s k ~ l by s k (i.e. we remove the topmost (k — l)-stack), and either d = no-data or d = d', 
or 

• (p,a,a,q, push^), and the stack of c' is obtained from the stack of c by replacing its topmost k- 
stack / : by (/ : s k ^ 1 ) : s k ^ 1 , and then by replacing its topmost 0-stack by (B,d) (i.e. we copy 
the topmost (k — l)-stack, and then we change the topmost symbol in the copy)[[J 

1 In the classical definition the topmost symbol can be changed only when k = 1 (for k > 2 it has to be s° = ?°). We make this 
extension to have an unified definition of push* for every k. However it is easy to simulate an automaton using one definition 
by an automaton using the other in such a way that the same language is recognized. 



Pawel Parys 



213 



Notice that we require that the stack of d is well-formed, so a pop^ operation cannot be executed if the 
topmost fc-stack has size 1. Observe that every push operation sets the data value in the topmost 0-stack 
to the just read data value, or to no-data if performing an £-transitionH On the other hand a pop operation 
can read a data value only if it is the same as the data value in the topmost 0-stack; however 0-stacks 
with data values can be also removed without reading anything. 

A run R, from cq to c m , is just a sequence cq \- X[ c\ h* 2 • • • H Xm c m . We denote R(i) := c,- and call 
\R\ := m the length of R. The subrun R\ t j is c, h t,+1 c !+ i \~ Xi+1 ■■■ \~ Xj cj. If a run R ends in the first 
configuration of a run S, we denote by R o S the composition of R and S which is defined as expected. 
The data word read by R is the data word over A obtained from X1X2 . . .x m by dropping all appearances 
of (e, no-data). A data word w is accepted by if it is read by some run from the initial configuration 
to a configuration having an accepting state. 

Collapsible «-HOPAD. In Section [3] we also use deterministic collapsible pushdown automata in- 
CPAD for short). Such automata are defined like rc-HOPAD, with the following differences. A 0-stack 
contains now three parts: a symbol from T, a data value from D, and n natural numbers. The natural 
numbers are not used to determine which transition can be performed from a configuration. Every 
push operation stores in the topmost 0-stack a tuple of n numbers (k\,k2, ■ . . ,k n ), where ki is the size 
of the topmost /-stack after this operation. We have a new operation collapse' for 1 < i < n. When it 
is performed between configurations c and d , then the stack of d is obtained from the stack of c by 
removing its topmost (i — 1) -stacks, so that only k{ — 1 of them are left, where kj is the i-th number stored 
in the topmost 0-stack of c (intuitively, from the topmost /-stack we remove all (/ — 1) -stacks on which 
the topmost 0-stack is present)!! 

3 The Separating Language 

In this section we define a data language U which can be recognized by a 2-CPAD, but not by any n- 
HOPAD, for any n. It is a data language over the alphabet A = {[,], $}. The language U contains the data 
words w over the alphabets = {[,],$} which satisfy the following conditions: 

• w contains exactly one dollar, and 

• w after removing the dollar is a well-formed bracket expression, i.e. in each prefix of w the number 
of closing brackets is not greater than the number of opening brackets, and in the whole w these 
numbers are equal, and 

• the suffix of w after the dollar is symmetric to some prefix of w (i.e. the Z-th letter from the begin- 
ning and the i-th letter from the end have the same data value, and one of them is ] and the other 
D- 

Notice that words in the language can be divided into three parts, the last of them preceded by a dollar. 
The first part is completely symmetric to the third part, while the second part is an arbitrary bracket 
expression. The first part ends on the last opening bracket which is not closed before the dollar (or this 
part is empty if all brackets are closed). 

2 In nondeterministic automata we should allow also e-transitions performing a push operation which put an arbitrary 
(guessed) data value in the topmost 0-stack. This is of course forbidden in deterministic automata. 

3 Again, a classical definition of the links stored and of collapse differs slightly, but this does not change the class of 
recognized languages. 
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Let us now show briefly how U can be recognized by a 2-CPAD. The 2-CPAD will use 4 stack 
symbols: [, ], X (used to mark the bottom of 1 -stacks), Y (used to count open brackets). The initial 
symbol is X. The automaton first performs push 2 (X). Then, for every bracket a on the input, 

• we perform push 1 (a) (we place the bracket on the stack, together with the data value), 

• we perform push 2 and pop 1 , 

• if a = [ we perform push^F), otherwise pop 1 . 

When at some moment the topmost symbol is X and a closing bracket is read, the word is rejected (the 
number of closing brackets till now is greater than the number of opening brackets). After reading a word 
of length k, we have k + 2 1-stacks. For 1 < i < k, the topmost symbol of the (i + 1 )-th 1-stack is equal 
to the i-th symbol on the input (including the data value); additionally the number of the Y symbols on 
the (i + 2)-th 1-stack is equal to the number of opening brackets minus the number of closing brackets 
among the first i letters of the input. 

Finally the dollar is read. If the topmost symbol is X, we have read as many opening brackets as 
closing brackets, hence we should accept immediately. Otherwise, the topmost Y symbol corresponds to 
the last opening bracket which is not closed. We execute the collapse operation. It removes the 1-stacks 
containing all the brackets read after this bracket. Now we repeat pop 2 until only the first 1-stack is left 
(which has X as its topmost symbol), and during each pop 2 we read appropriate bracket from the input 
(the automaton ensures that the data value on the input is the same as the data value on the stack, hence 
the same as in the first part of the input). 

In the remaining part of the paper we prove that any «-HOPAD cannot recognize U ; in particular all 
automata appearing in the following sections does not use collapse. 

A proof for level 1. In order to show intuitions how the proof of Theorem 11.11 works, we present it 
first for level 1. Thus consider a 1-HOPAD which recognizes U. We give him a data word w with 
7t(w) = [ ] N and with every data value different, where N is a big enough number (namely, a number 
greater than the number of states). 

Notice that the unique run of srf on w is deterministic) stores all data values of w on the stack, 
and none of them can be removed from the stack until the end of w is reached. Indeed, we can afterwards 
give [$ to the automaton, and he has to reproduce all data values of w (the only suffix after which srf 
should accept contains all the data values of w). 

Let s be the stack after reading the part [ N . Now assume that a dollar appear after some number k 
of closing brackets (i.e. we have [ N ] k $ instead of [ N ] N ). Just after the dollar, the stack of stf still begins 
with s. As the data values from the [ N part appear only in s, we have to reach a configuration (qk,s) for 
some state q^. As we have not enough states, for two numbers of closing brackets k,k', the state will be 
the same, i.e. q^ = q^. Afterwards, the unique accepting run from (qk,s) and from {qy,s) is the same, 
in particular the same subset of data values appearing in s is read. This is not what is expected by the 
language U, as in one case the first N — k data values of w should be read, and in the other case the first 
N — k' of them. We have a contradiction. 

For higher levels the overall idea will be the same. In the next section we will define upper runs and 
returns. In our level- 1 example, the part of the run which does not modify s will be an upper run, and the 
part which analyzes s will correspond to a return. Next we will say that a stack can be accessed only in 
a constant number of ways (like here we could only visit it for each of the states). These ways will be 
described by "run descriptors" in Section|5J for each of these ways we can recover only one subset of the 
data values stored in the stack. Finally, in Section [6l we conclude the proof. 
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Table 1 : Stack contents of the example run, and subruns being &-upper runs and ^-returns 
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stack of R(j) 


{i:R\ij 0-upper} 


{i:R\ u 1-upper} 


{i : R\ t j 1-return} 





R\ Lj 2-return} 





[ab] [cd] 


{0} 


{0} 
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[ab] [cd] [ce] 


{0,1} 


{0,1} 










2 


[ab][cd][c] 


{2} 


{0,1,2} 


{0,1} 
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[ab] [cd] 


{0,3} 


{0,3} 





{1 


2} 
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[ab][c] 


{4} 


{0,3,4} 


{0,3} 
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[ab] [cd] 


{4,5} 


{0,3,4,5} 










6 


[ab][c] 


{4,6} 


{0,3,4,5,6} 


{5} 








4 Upper runs and returns 

In this section we define two important classes of runs, called upper runs and returns, and we show how 
they decompose. 

A run R is called k-upper, where < k < n, if the topmost &-stack of /?(|/?|) is a copy of the topmost 
&-stack of R(0), but possibly some changes were made to it. This has to be properly understood for k = 0: 
we assume that a push operation makes a copy of the topmost 0-stack, and then modifies its contents; in 
particular a run which just performs push is 0-upper. Notice that every run is «-upper, and that a fc-upper 
run is also /-upper for / > k. 

A run R is called a k-return, where 1 < k < n, if 

• the topmost (k — l)-stack of /?(|/?|) is obtained as a copy of the second topmost (k — l)-stack of 
R(0) (in particular we require that the topmost &-stack has size at least 2), and 

• while tracing this copy of the second topmost (k — l)-stack of R(0) which finally becomes the 
topmost (k — l)-stack of it is never the topmost (k — l)-stack of R(i) for all i < \R\. 

Notice that a run R is a ^-return exactly when the topmost /c-stack of /?(|/?|) is obtained from the topmost 
&-stack of R(0) by removing its topmost (k — l)-stack, and this removing was done in the last step of R. 
In particular every ^-return is &-upper. 

Example 4.1. Consider a HOPAD of level 2. In this example we omit data values; brackets are used to 
group symbols in one 1 -stack. Consider a run R of length 6 which begins with a stack [aA][c<i], and the 
operations between consecutive configurations are: 

push 2 (e), pop 1 , pop 2 , pop 1 , push 1 ^), pop 1 . 

Recall that our definition is that a push of any level can change the topmost stack symbol. The contents 
of the stacks of the configurations in the run, and subruns being fc-upper runs and ^-returns are presented 
in Table [TJ Notice that R is not a 1-return. 

It is important that returns and upper runs can be always divided into smaller fragments which are 
again returns and upper runs. More precisely, from lfT4l we have the following characterizations of k- 
returns and k-upper runs (these decompositions can be obtained quite easily by analyzing all possible 
behaviors of an automaton). 

Proposition 4.2. A run R is an r -return (where 1 <r <n) if and only if 
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1. \R\ = 1, and the operation performed by R is pop r , or 

2. the first operation performed by R is pop* for k < r, or push* for k^r, and R \\.\r\ is an r-return, 
or 

3. the first operation performed by R is push*/or k > r, and R \im is a composition of a k-return and 
an r-return. 

Proposition 4.3. A run R is k-upper (where <k <n) if and only if 

1. R performs only operations of level at most k, or 

2. \R\ = 1, and the operation performed by R is push r for any r>k+ 1, or 

3. the first operation performed by R is push r for r>k+\, and R\\\r\ is an r-return, or 

4. R is a composition of two nonempty k-upper runs. 

5 Types of Stacks 

In this section we assign to each &-stack a type from a finite set which, in some sense, describes possible 
returns and upper runs from a configuration having this &-stack as the topmost &-stack. Additionally, we 
also define a set of data values which can be extracted from a &-stack; the idea is that we can extract 
either none or all data values contained in such a set. 

For this section we fix an rc-HOPAD srf with stack alphabet T, state set Q, and input alphabet A. 
Moreover we fix a morphism (p: A* — ^ M into a finite monoid. In our application the monoid will be 
testing whether the word is empty, or is a single closing bracket, or begins with a dollar, or is any other 
word; however the results of this section work for any monoid. For a run R we write (p (R) to denote the 
value of <p on the word read by R (more precisely cp is applied only to the first coordinates of the letters 
of this word). 

5.1 Returns 

We begin by defining types which describe returns. In the next subsection we indirectly use the same 
types to describe upper runs; this will be possible because upper runs consist of returns (as described by 
Proposition I4.3I ). To every &-stack s k (where <k<n) we assign a set type(s*) C £T k ; it contains some 
run descriptors. The sets 3? k of all possible run descriptors are defined inductively as follows (where 
£P(X) denotes the power set of X): 

ST k = {ne} U {&>{ST n ) x @>(3? n - 1 ) x • • • x &>(^ k+l ) xQx@ k ), where 

n 

9 k = |J Mx{r}x0>(^ n )x^(^ n - l )x---x^(^ r+l )xQ. 

r=k+l 

Recall that Q is the set of states, and M is the finite monoid used to distinguish classes of input words. 

Before we define types, let us describe their intended meaning. We say that a run R agrees with 
(m,r,I, n ,'L n -\...,I, r+l ,q) € #if 

• (p(R) = m, and 

• R is an r-return, and 

• R ends in a stack t" : t n ~ l : • • • : f such that £' C type(f ! ) for r + 1 <i<n, and 



Pawel Parys 



217 



• the state of R(\R\) is q. 

The acronym ne stands for nonempty; we have ne G type(s*) when s k is nonempty. A typical run de- 
scriptor in 8? is of the form a = ( x ¥ n , x ¥ n ~ 1 ,..., x ¥ +1 ,p,S). By adding a to the type of some s k , we 
claim the following. If for each k + 1 < i <n we take an /-stack t' that satisfies the claims of x i J1 , then 
there is a run which starts in state p and stack t n : t n ~ x t k+1 : s k , and agrees with a. In other words 
we have the following lemma. 

Lemma 5.1. Let <k <n, letd G ^ , and Zef c = (p, s" : .s" -1 : ■■■ :s k )be a configuration. The following 
two conditions are equivalent: 

1. there exists a run from c which agrees with o, 

2. type(/) contains a run descriptor ( x ¥ n , x ¥"~ l ,. . . j^i 1 ^ 1 ,p,3) such that^" C type(s') for k+ 1 < 
i < n. 

Moreover, to every /c-stack s k (where < k < n), and every run descriptor a G type(^) we assign a 
set of important data values \Av(s k , o) C B. The idea is that although s k may contain many data values, a 
run having s k in its first configuration and described by a can use only some subset of these data values; 
these will be the important data values of s k (from the point of view of a). There is a small technical 
difficulty that a run can read (by accident) a data value already present in s k while performing a push 
operation; of course this is not what we want to count. To deal with this, we distinguish some data 
value G D and we define normalized runs: a run R is normalized when every of its non-£-transition 
performing a push reads data value 0. The sets idv are designed to satisfy the following lemma. 

Lemma 5.2. Let <k <n, let a = (m,r,L n ,L"- 1 , . . . ,L r+l ,q) G 9 k , let c = (p,s n : s"- 1 : • • • : s k ) be a 
configuration, and let d G O \ {0}. The following two conditions are equivalent: 

1. there exists a normalized r-return from c which agrees with o, and either reads d or ends in a 
stack t" : t"~ l such that d G id v(t', t) for some r+l <i<n and some T G 

2. type(^) contains a run descriptor a = (*1 / ' , , X F" _I , . . . , x i ,k+l ,p, a) such that *F C type(s') for 
k+ \ <i<n, and d G \dv(s k , o)ord£ \dv(s l ,t) for some k+l <i<n and some X G *F'. 

In the first condition we only say that there exists such run, moreover it can be different for every 
data value. However we will use this lemma only in situations when this run is in some sense unique, 
so that this unique run will be described. Observe also that a data value which was important in the first 
configuration of a run need not to be always read: it may also stay important in the last configuration of 
the run. We can exclude this possibility by taking empty sets L l (in particular by taking r = n). 

Now we come to the definition of types and important data values. We first define a composer, which 
is then (in Definition [53]) used to compose types of smaller stacks into types of greater stacks. 

Definition 5.3. We define when (<t> k ,<t> k 1 . . ., < i> l ; x ¥ k ) is a composer, where < I <k<n,<t>' C 2F l for 
l<i<k, and C 3T k . 

1. Let a = (I",!"- 1 ,. . . ,L l+ \p,a) G ST 1 , and let z = (I",!"" 1 ,. . . ,L k+1 ,p,a) G ST k (in particular 
a G $> k ). Then we say that (E*,E* _1 ,...,L' +1 ,{(t};{t}) is a composer. 

2. Tuple (0,0, ... ,0; {ne}) is a composer. 

3. Let C ST k , and for each % G x ¥ k let (O^^ 1 , . .. ,<I>' T ; {t}) be a composer. Then we say that 
(<j>*: ; <$> k . . . ,<t> l ; x ¥ k ) is a composer, where <t> 1 = U Te >f * ^\ f° r I <i<k. 

Now we will define types of 0-stacks, and sets of important data values as a fixpoint. 
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Definition 5.4. For each 0-stack s Q , let type(s°) be the smallest set satisfying the following conditions; 
additionally for a G type(5 , °), let idv(5 ,0 ,a) be the smallest set satisfying the following conditions. Let 
s Q = (a, d) be a 0-stack, and let p be a state. 

1. Assume that si has a transition (a,p,a,q\, pop k ) (where 1 < k < n and a G A U {e}), and that 
either a = e or d ^ no-data. We have two subcases: 

(a) Let 

Then a G type(.s ). Ifa^e, then d G idv(s°, a). 

(b) Let T = . . . , , ? i , (m, r, E", , . . . ,I7 +1 , ? )) G and 

<7 = ('P^>P"-^...,^ + ^{T},0,0,...,0,p,(<p(a)m,r,E^E"-^...,r +1 ^))G^ O . 

Then a Gtype(s°). Ifa^e, then Jg idv(j , <r). 

2. Assume that ^ has a transition (a,p,a,q\ , pusl/(j6)) (where 1 < £ < n). Let f° = Q3, no-data) if 
a = e, or = (p\0) otherwise. Assume that type(?°) contains a run descriptor 

r = (^^"-\...^\ qi ,( mi , n ,a'\a"- l ,...,^ + \ q2 )). 

Assume also that (<& c ,<& c ~ 1 , . . . ,<$>°; x ¥ k ) is a composer such that <$>° C type(/). We have two 
subcases: 

(a) Assume that r\ / k. Let 

o = (n",n"-\...,n\p,( ( p(a)m u r l ,a n ,a n -\...,a^\ q2 ))e^ , 

where 

' »F for£+l <i<n, 

IT = < <J>< for i = k, 

*FU<J>' for 1 <i<it-l. 

Then a G type(j°), and \J petS> o idv(s°,p) C idv(s°, a). 

(b) Assume that r\ = k and type^ ) contains a run descriptor 

X=(^\a n - l ...,Q. k+ \T k J k -\...j\ q2 ,(m 2 ,r 2 ,L^L n -\...X 2+ \q3)), 
where r 2 < k. Let 

<7 = (n«,n"- 1 ,...,n 1 , j p,(<p(a)m 1 m 2 ,r 2 ,E",E"- 1 ,...,E^ +1 ^ 3 ))e^ , 

where 

forifc+1 <?'<«, 
$'ur fori = it, 

tu^'ur forl<j<A:-l. 

Then a G type(j°), and \dv(s°,x) UU pe $o idv(j°,p) C idv(j°,<7). 
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Notice that the above operations are monotone (more run descriptors in types can only cause that 
more run descriptors are added to other types), thus the unique smallest fixpoint exists. Finally, we 
define types of greater stacks. 

Definition 5.5. For each /c-stack s , where 1 <k<n, let type(s k ) be the smallest set satisfying the fol- 
lowing conditions; additionally for a G type(/), let idv(/, a) be the smallest set satisfying the following 
conditions. 

Assume that s k = t k : t k ~ l , and there exists a composer (<$> k ,<t> k ~ l ;{o}) for which 4>* C 
type(^) and <t> k 1 C type^ 1 ). Then a G type(/), and U*=*-i Uped* idv(?',p) C idv(/, a). 

Let us now observe immediate consequences of Definitions [53] and [53] We see that ne is in the type 
of a &-stack s k if and only if k > 1 and s k is nonempty. We also see that a run descriptor a is in the type 
of a &-stack : s k ~ l : • ■ • : s 1 (where I < k) if and only if there exists a composer (O*,^ -1 ,...,<!>'; {a}) 
such that <!>' C type(s') for I < i < k (moreover <!>' has exactly one element). Similarly, d G \dv(s k : 
s k ~ 1 : • • • : s l , a) if and only if there exists a composer (O^,^ -1 , . . . ,<!>'; {a}) such that C type^') 
for / < i < k and that d G idv^', t) for some I < j <k and some T G <J> ; '. 

It is tedious but straightforward to prove Lemmas [5.1| and |5T2] directlv from the definitions of type and 
idv. For the 1=>2 implications, we decompose the run according to Proposition I4.2[ this tells us which 
rules of Definition 15 .4| imply that an appropriate run descriptor is in the type, and that the considered data 
value in in idv. For the opposite implication, a run descriptor is in the type (or a data value is in idv), 
because there is some derivation using the rules from Definition I5.4[ by composing all these rules we 
obtain an appropriate run. 

Let us mention that a similar notion of types was also present in [15]. Those types were defined 
in a different, semantical way. Namely, our Lemma 15.11 is used as a definition; then it is necessary to 
prove that the type of s k does not depend on the choice of s'\s n ~ l , . . . ,s k+l present in the assumptions of 
the lemma. This semantical definition is unsuitable for tracing important data values. Our approach is 
better for the following reason: when we add some run descriptor , . . . ,p, a) to the type 

of some &-stack, then in the sets we only have the assumptions which are really useful (we never put 
there redundant run descriptors). 

5.2 Upper runs 

For upper runs we do not define separate types; we will be using here the types for returns. Our goal is 
to obtain the following transfer property. 

Lemma 5.6. Let <k <n, and let R be a normalized k-upper run such that no other normalized run S 
from R(0) ends in the same state as R and satisfies (p(S) = (p(R)- Let s n : s n ~ l : • • • : s k be the stack or 
R(0), and t" : t n ~ x : • • • : t k the stack ofR(\R\). Let d,d' G B\ {0} be data values which are not read by 
R and which do not appear in s k , and such that for each k+l < i <n and each G G type(s') it holds 
d G idv(s',a) 43>d' G idv(s',a). Then d and d' do not appear in t k , and for each k+l <i<n and each 
a G type(f ! ) it holds d G idv(/', a) d' G idv(?', a). 

Intuitively, the lemma says that if two data values cannot be distinguished by the idv sets in the initial 
configuration of a &-upper run, they also cannot be distinguished in the final configuration. In order to 
obtain this lemma we define src sets as follows ("src" stands for "source")- Let < k < n, let R be a 
normalized &-upper run from stack s" : s n ~ l : • • • : s k to stack t" : t n ~ l : ■■■ :t k , and let £' C type(f') for 
k + \<i<n. For ifc + 1 < j <n we define a set src^/?,!",!"" 1 , . . . C type(^). To shorten the 

notation, we write src- 7 for srd (R,T. n ,L n ^ 1 , . . . ,T> ) (we should keep in mind that src 7 depends on R 
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and on E" , , . . . , L k+l ). The intuition is that d G idv(V , a) for some & + 1 < i < n and some a G £' 
if and only if d G idv(V, t) for some k+l < j <n and some z G src 7 . The actual statement is more 
complicated, and is given by the following lemma. 

Lemma 5.7. Let <k <n, let R be a normalized k-upper run from (p,s" : s n ~ l : ■■■ : s k ) to (q,t n : t n ~ l : 
■ ■ ■ :t k ), and let £' C type(V ) for k + 1 < i <n. Let also d G B \ {0} fee a ctaa va/we which does not 
appear in s k . 

1. Assume that d G idv(7', o) for some k+ \ <i<n and some O G £'. Then d G idv^, z) for some 
k+l < j <n and some % G src 7 . 

2. Let c = (p, u n : u n ~ l : • • • : u k ) be a configuration such that u k = s k and src' C type(w') for k + 1 < 
/ < n. Assume that d G idv(«- / , z)for some k+ I < j < n and some T G sro 7 (/?,£", \ . . . 

Then there exists a normalized k-upper run S from c to a configuration (q,v n : v" _1 : ••• : v*) 
such that (p(S) = (p(R), and v k = t , and £' C type(v') for k + 1 < i <n, and either S reads d or 
d G idv(V, (j) for some k + 1 <i<n and some a G E ! . 

Now we come to the definition of the src sets. We define them by induction on the length of R, and 
we make a case distinction on the form of R as in Proposition 14.31 

1 . Assume that R performs only operations of level at most k. Then we take src' = L' for k + 1 <i <n. 

2. Assume that \R\ = 1 and the operation performed by R is push r for r > k + 1. Then we define src' 
as the smallest sets satisfying the following: 

• X' C src' for k + 1 <i<n,i^ r, and 

• for every composer ( x ¥ r , x ¥ r ~ l , . . . , x ¥ k ; {o}) such that a G 57 and *F C type(V) for k < i < r 
it holds »F C src' for k + 1 < i < r. 

3. Assume that the first operation of R is push', and ^ti,|«| i s an ^-return, where r > k+l. Let 
p = ((p(R\ l \ R \) ,r,L n , . . . ,L r+ \q), where q is the state of R{\R\), and let s k be the topmost 
&-stack of R(l). We define src' as the smallest sets satisfying the following: 

• £' C src' for k + 1 <i<r, and 

• for every run descriptor p = (<£",<!>" ! , . . . jO^+^ijp) G type(^) such that <!>' C type(s') 
for k+ 1 < i < n, i / r, and O'' C type(s r : : ■ • • : it holds <!>' C src' for k + 1 <i <n, 
i 7^ r, and 

• for every run descriptor p = (<J>",<J>" _1 , . . . ,q\,p) G type(^) such that <!>' C type(5') for 
k+ \ <i <n, i^r, and <J> r C type(5 r : s r ~ l : • • • : s k ), and for every composer ( V F T , V P~ 1 , . . . , 

{A}) such that A G <I> r and »F'' C type(y') for k < i < r it holds W C src'' for £+ 1 < i < r. 

4. Assume that R = R\ oR 2 for shorter &-upper runs Ri,R 2 (if there are multiple choices of Ri,R 2 , 
we just fix any of them), and case 1 does not hold. The src' sets for R\ and for R 2 are already 
defined by induction. Denote <!>'' = src'' (R 2 ,L n X"- 1 , ... ,L k+l ) for k + 1 <i<n. We define src'' := 
src'^i,^,^- 1 ,...,^ 1 ) forit+1 <i<n. 

The proof of Lemma [5/7] is tedious but again follows easily from the definition. In the third case we 
use Lemma I5T21 Notice that point 1 of Lemma 15721 tells us only that a run exists, so we need a similar 
statement in point 2 of our lemma. We finish by a proof of Lemma [5761 

Proof of Lemma The part that d and d' do not appear in t k is immediate, as by definition of a &-upper 
run, t k is a (possibly modified) copy of s k . Because s k does not contain d nor d', and R is normalized, 
they cannot appear on this stack. 
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Assume that d € \dv(f, a) for some k+ 1 <i<n and some a G type(f'). Take £' = for fc+ 1 < / < n, 
I ^ i, and £' = {a}. By Lemma 15.7111 <i £ idv(s ; , t) for some k+ 1 < j < n and some T G src ; . By 
assumption also d! G idv(^-', t). Now we use Lemma l5.7l2l for configuration c = R(0). We obtain a run S, 
which ends in the same state as 7?, and such that <p(S) = (p(R); this means that R = S. The lemma implies 
that d' € \dv(t l ,o f ) for some k + 1 < / < n and some a' G As only £ ! is nonempty and contains only 
<T, we obtain <i' G idv(f', a). The opposite implication is symmetric. □ 

6 Why U Cannot Be Recognized? 

In this section we prove that language U cannot be recognized by a deterministic higher-order pushdown 
automaton with data of any level. We base on the techniques developed in the previous sections. 

Of course our proof goes by contradiction: assume that for some n we have an (n — l)-HOPAD 
recognizing U. We construct an «-HOPAD which works as follows. First it makes a push" operation. 
Then it simulates the (n — l)-HOPAD (not using the push" and pop" operations). When the (n — 1)- 
HOPAD is going to accept, srf makes the pop" operation and afterwards accepts. Clearly, srf recognizes 
U as well. Such normalization allows us to use Lemmas 15-11 and [5721 as in srf every accepting run is an 
^-return. Moreover, we assume that to each state lead either only e-transitions, or only non-£-transitions 
(it can be easily achieved by having two copies of each state). 

As already mentioned, let q> : A* — > M be a morphism into a finite monoid which tests whether the 
word is empty, or is a single closing bracket, or begins by a dollar, or is any other word. This morphism 
is used with the definition of types and important data values in the previous section. 

Let N = 2 + 2 M , where M = £" =0 \^'\- Consider the following words: 

w = [][ w k+{ =w N k ] N [ forO<£<rc-l, 

where the number in the superscript (in this case N) denotes the number of repetitions of a word. Let w n 
be a data word in which every data value is different and is not equal to 0, and such that 7t(w n ) = w n . 

Let R be the unique run from the initial configuration which reads w n , and ends just after reading its 
last letter. Abusing slightly the notation, for any prefixes v, w of Wk, we define R(y) to be the configuration 
of 7? just after reading |v| letters, and R(v,w) to be the subrun of R between R(y) and R(w). It will be 
important to analyze relations between these configurations R(y) for different prefixes. 

Notice that every data value read by 7? appears somewhere in a stack of R(v). This is true, because 
after reading [$, the automaton has to check in particular whether the data values present later on the 
input are the same as those already read by R. However this contradicts with the following key lemma 
(taken for k = n and u = e). 

Lemma 6.1. Let < k <n, and let u be a word such that uwk is a prefix ofw n . Then some data value 
read by R(u, uwk) does not appear in the topmost k-stack ofR(uwk)- 

Proof. The proof is by induction on k. For k = this is obvious, as three data values are read, but the 
topmost 0-stack can store only one of them. 

Let now k > 1. Assume first that for some nonempty prefix v of wu the run R(uv, uwt) is not (k — 1)- 
upper. Let us look at the data value read by the last operation before R(uv). This data value appears 
only once on the stack of R(uv), in the topmost 0-stack. One possibility is that R(uv, uwk) is not £-upper. 
By definition this means that the topmost &-stack of R(uwk) is not a copy of the topmost /c-stack of 
R{uv), thus it cannot contain this data value. The opposite possibility is that R(uv,uwt) is &-upper, but 
not (k — 1) -upper. This means that the topmost &-stack of R(uwk) is obtained as a copy of the topmost 
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&-stack of R(uv), however the topmost (k — l)-stack of this &-stack is not a copy of the topmost (k — 1)- 
stack of R(uv). The only way how this could happen is that all copies of the topmost (k — l)-stack of 
R(uv) were removed from this &-stack Then also this /c-stack does not contain the considered data value. 

For the rest of the proof assume the opposite: that for each nonempty prefix v of wt the run R(uv, uwu) 
is (k — 1) -upper. We will obtain a contradiction, which will prove that this is in fact impossible. From 
this assumption we get the following property Z>. 

Let v' be a prefix of Wk, and v a nonempty prefix of V . Then R(uv, uv') is (k — l)-upper. 

Indeed, if the topmost {k — 1) -stack of R(uwk) is a copy of the topmost {k — 1) -stack of R{uv') and of the 
topmost (k — l)-stack of R(uv), then for sure the topmost (k — l)-stack of R(uv') is a copy of the topmost 
(k- 1) -stack of R(uv). 

From the induction assumption (where uW^\ is taken as u), for each 2 < i < N there exists a data 
value di read by R^w'^^uw 1 ^) which does not appear in the topmost (k — l)-stack of R^w 1 ^). As 
R(uw' k _ l ,uw^_ l ) is (k — l)-upper (property 'v'), we know that dj does not appear in the topmost (k — 1)- 
stack of R(uwf _ { ) as well. 

Denote the stack of R(uw^_ l ) as s" : s"~ l : ••• : Since there are more possible indices i G 

{2,3, . . . ,N} than subsets of U'^typeis'), there have to exist two indices 2 < x < y < N such that for 
each k < i <n and each X G type(s ! ) it holds d x G idv(V,T) 44> d y G \dv(s l ,x). Let r = N — x + 1. Let 
us consider the unique accepting run 5 from R{uw 1 ^_ l ] r ) whose first read letter is a dollar (this run is 
unique because the U language determines the rest of the word after a dollar). Notice that the last 
opening bracket which is not closed before the dollar is the last bracket of the .x-th wt-i after u. Thus 
by definition of U, S reads d x and does not read d y . This contradicts with the thesis of the next lemma 
(where A: — 1 is used as k). □ 

Lemma 6.2. Let <k <n, and let u be a word and r a number such that u] r is a prefix ofw n . Assume 
that R(u]'~ l ,u]') is k-upper for each 1 <i < r. Denote the stack of R(u) as s" : s n ~ l : • • • : s k . Let also 
d,d' be data values which are not read by R(u,u] r ), which does not appear in s k , and such that for each 
k+1 <i <n and each T G type(s') it holds d G idv(V , t) d' G idv(s ! , t). Then the unique accepting 
run from R(u] r ) whose first read letter is a dollar, either reads both d and d', or none of them. 

Proof. Originally we have assumed that every data value read by R is different. Let us now make a small 
twist, and assume that the data values appearing with the r closing brackets after u are all equal to 0. 
This does not affect the statement of the lemma, as the changed data values does not appear in the stack 
of R(u) nor these are d or d'. Thanks to this change, the runs R(u] ,u]') are normalized for 1 < i < r. 
Notice also that R(u] ,u]') is the only normalized run from R(u] ) which ends in the same state as 
R(u]'~ l ,u] 1 }, and reads a single closing bracket (recall that equality of states causes that such a run ends 
just after reading a letter). Thus we can apply Lemma [531 for the run R{u] , u]'} and for data values d, d' . 
Denote the stack of R(u] r ) as t" :t n ~ l After applying this lemma consecutively for i = 1,2,. ..,r, 

we obtain that d,d' does not appear in t k , and that for each k + 1 < i < n and each x G type(?') it holds 
d G idv(?', x) 4» d' G idv(f ! ', x). 

Now consider the unique accepting run from R(u] r ) whose first read letter is a dollar. We see that it is 
normalized (it does not read any new data values), and that this is an ^-return (as every accepting run of 
g/). It agrees with o = ((p(S),n,q) for some accepting state q. Lemma l5Tl implies that type(t k ) contains 
a run descriptor a = (<J>",<I>" _1 , . . . ,<t> k+l ,p, a) such that <t> 1 C type(?') for k + 1 < i < n. Assume that 
S reads one of d,d', say d. Then implication 1=^2 of Lemma \52\ says that d G \dv(t',x) for some 
k+1 < i <n and some x G (the lemma allows also that d G idv(?^, a), but this is impossible as d 
does not appear in t k ). However in such case also d' G idv(? ! , t). Using now the opposite implication of 
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Lemma [5T21 we obtain a normalized run 5" from 5(0) which agrees with a and reads d' . But 5' is also 
accepting and its first letter read is a dollar, so 5' = 5. □ 
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